Estimating Covariance for Privacy Case under Interval (and Fuzzy) Uncertainty
نویسندگان
چکیده
One of the main objectives of collecting data in statistical databases (medical databases, census databases) is to find important correlations between different quantities. To enable researchers to looks for such correlations, we should allow them them to ask queries testing different combinations of such quantities. However, when we receive answers to many such questions, we may inadvertently disclose information about individual patients, information that should be private. One way to preserve privacy in statistical databases is to store ranges instead of the original values. For example, instead of an exact age of a patient in a medical database, we only store the information that this age is, e.g., between 60 and 70. This idea solves the privacy problem, but it make statistical analysis more complex. Different possible values from the corresponding ranges lead, in general, to different values of the corresponding statistical characteristic; it is therefore desirable to find the range of all such values. It is known that for mean and variance, there exist feasible algorithms for computing such ranges. In this paper, we show that similar algorithms are possible for another important statistical characteristic – covariance, whose value is important in computing correlations.
منابع مشابه
Interval-Valued Hesitant Fuzzy Method based on Group Decision Analysis for Estimating Weights of Decision Makers
In this paper, a new soft computing group decision method based on the concept of compromise ratio is introduced for determining decision makers (DMs)' weights through the group decision process under uncertainty. In this method, preferences and judgments of the DMs or experts are expressed by linguistic terms for rating the industrial alternatives among selected criteria as well as the relativ...
متن کاملrisk assessment by integration approach of FMEA and multi criteria decision-making in the interval valued fuzzy environment: case study hydraulic pump manufacturing industry
Abstract Background and aims: Nowadays with increasing global competition, companies apply several scientific methods to identify, assess and remove potential failures in production process. The main goal of this study was identification and analysis of potential failure modes in a hydraulic pump manufacturing company by using combination of interval valued fuzzy Analytic network process (IVF-...
متن کاملA New Version of Earned Value Analysis for Mega Projects Under Interval-valued Fuzzy Environment
The earned value technique is a crucial and important technique in analysis and control the performance and progress of mega projects by integrating three elements of them, i.e., time, cost and scope. This paper proposes a new version of earned value analysis (EVA) to handle uncertainty in mega projects under interval-valued fuzzy (IVF)-environment. Considering that uncertainty is very common i...
متن کاملEstimating third central moment C3 for privacy case under interval and fuzzy uncertainty
Some probability distributions (e.g., Gaussian) are symmetric, some (e.g., lognormal) are non-symmetric (skewed). How can we gauge the skeweness? For symmetric distributions, the third central moment C3 def = E[(x − E(x))] is equal to 0; thus, this moment is used to characterize skewness. This moment is usually estimated, based on the observed (sample) values x1, . . . , xn, as C3 = 1 n · n ∑ i...
متن کاملA Multi-Criteria Analysis Model under an Interval Type-2 Fuzzy Environment with an Application to Production Project Decision Problems
Using Multi-Criteria Decision-Making (MCDM) to solve complicated decisions often includes uncertainty, which could be tackled by utilizing the fuzzy sets theory. Type-2 fuzzy sets consider more uncertainty than type-1 fuzzy sets. These fuzzy sets provide more degrees of freedom to illustrate the uncertainty and fuzziness in real-world production projects. In this paper, a new multi-criteria ana...
متن کامل